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Abstract 

Bit-interleaved coded modulation (BICM) transceivers often use equally spaced constellations and 
a random interleaver In this paper, we propose a new BICM design, which considers hierarchical 
(nonequally spaced) constellations, a bit-level multiplexer, and multiple interleavers. It is shown that this 
new scheme increases the degrees of freedom that can be exploited in order to improve its performance. 
Analytical bounds on the bit error rate (BER) of the system in terms of the constellation parameters and 
the multiplexing rules are developed for the additive white Gaussian Noise (AWGN) and Nakagami-m 
fading channels. These bounds are then used to design the BICM transceiver. Numerical results show 
that, compared to conventional BICM designs, and for a target BER of 10^®, gains up to 3 dB in the 
AWGN channel are obtained. For fading channels, the gains depend on the fading parameter, and reach 
2 dB for a target BER of 10 ^'^ and m = 5. 
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I. Introduction 

Bit-interleaved coded modulation (BICM) |[I|-|l3l is used in most of the existing wireless 
communication standards, e.g., HSPA, IEEE 802.1 la/g/n, DVB, etc. In BICM, the channel 
encoder and the modulator are separated by a bit-level interleaver which allows the designer to 
choose the code rate and the constellation independently. BICM maximizes the code diversity, 
and therefore, outperforms trellis coded modulation (TCM) in fading channels. Compared to 
TCM, BICM is suboptimal for the additive white Gaussian noise (AWGN) channel because it 
decreases the minimum Euclidean distance. Nevertheless, its simplicity and flexibility make it 
an attractive coded modulation scheme even when fading is not present. 

BICM appears as a simple out-of-the box coded modulation scheme, however, its full potential 
is achieved only if its design is optimized. It has been shown in BU that the interleaver and the 
code can be jointly designed to exploit the so-called unequal error protection (UEP) caused by 
the binary labeling of the equally spaced (ES) constellations. In general, if the channel offers 
UEP to the coded bits, gains in terms of bit error rate (BER) can be obtained, cf. [jH Sec. I] and 
references therein. UEP can also be intentionally introduced when designing the transceiver. For 
example, UEP can be imposed by allowing unequal power allocation for different bits, by deleting 
bits using certain patterns (puncturing), by changing the binary labeling of the constellation, or 
by using signal shaping, i.e., by using nonequally spaced (NES) constellations or nonequally 
likely symbols, known as geometrical and probabilistic shaping, respectively. 

In this paper, we propose to control the UEP via geometrical shaping. In particular, we use the 
so-called hierarchical constellations [5], which have received a great deal of attention in many 
applications where independent data streams with different qualities must be sent at the same 
time, e.g., in multi-resolution image transmission [[6l, [|71, and simultaneous voice and multi-class 
data transmission Hierarchical quadrature amplitude modulation (HQAM) constellations are 
also used in QUALCOMM's MediaFLO ^ and have been standardized for the latest digital 
video broadcasting-terrestrial (DVB-T2) ifTOll . [fTTIl . In this paper, we use HQAM constellations 
in a different context, i.e., to transmit only one data stream with improved error performance. 

Although shaping techniques for BICM have received some attention in the literature (e.g., ge- 
ometrical in |[T2l - lfT4l and probabilistic in lUSl . ffT6l Sec. III-F]), they are all based on capacity 
maximization, which translates into enhanced performance if capacity-approaching codes (turbo 
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or low-density parity-check codes) are used. On the other hand, capacity arguments are less 
relevant when convolutionally-encoded BICM is considered. In this paper, we study this simple 
and low-complexity BICM configuration, and thus, we adopt a different approach that aims at 
the minimization of the BER for a given signal to noise ratio (SNR). 

In order to exploit the UEP offered by the constellation, the interleaver must be properly 
designed. The most commonly interleaver considered in the literature is the single interleaver 
(S-interleaver) of [[3, which eliminates the UEP caused by the binary labeling^. Recently, the 
the so-called multiple interleavers (M-interleavers) flU were shown to improve the performance 
of the system by exploiting the UEP caused by the modulator. In fact, the use of BICM with 
M-interleavers (BICM-M) corresponds to the original BICM configuration proposed by Zehavi 
in [[ll, as well as the original BICM with iterative decoding (BICM-ID) scheme proposed by 
Li and Ritcey in iflTl . M-interleavers have also been shown to outperform S -interleavers when 
BICM-ID is considered |[T8||. 

The BICM-M system in flU, [[T8l| uses a random bit-level multiplexing (R-MUX) that connects 
the encoder and the M-interleavers, i.e., the M-interleavers assign the coded bits to a particular 
bit position in the modulator in a pseudo-random fashion (with predetermined probabilities). 
By doing this, the dependency of adjacent coded bits is ignored. In this paper, we propose an 
multiplexing/interleaving inspired by the well-known puncturing strategy based on the periodic 
elimination of the bits according to a prescribed pattern that matches the temporal structure of 
the code, cf. [[T9ll . We show that such a deterministic multiplexing (D-MUX) of the coded bits 
(followed by random interleaving) notably outperforms the R-MUX used in flU. 

The contributions of this paper can be summarized as follows. We propose and study a BICM 
scheme for fading and nonfading channels which considers the use of HQAM constellations 
(HQAM-BICM), a periodic (and deterministic) bit-level multiplexer, and M-interleavers. It is 
demonstrated that the degrees of freedom of such a scheme can be exploited to notably improve 
performance of the system in terms of BER. We use a Gaussian model for the probability 
density function (PDF) of the L-values passed to the decoder which consider the nonequal 
spacing between the signal points in the constellation, and apply it to develop union bounds 
(UB) on the BER of the system for fading and nonfading channels. These UBs are then used 

'UEP in BICM was in fact considered an "undesired feature" in (2] Sec. II]. 
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to optimize the transceiver. Presented numerical examples show that the proposed system offers 
gains over previous BICM configurations (ES-QAM and S-interleavers |l2l or ES-QAM and M- 
interleavers [jH). For the particular cases analyzed in this paper, the gains can be up to 3 dB for 
a BER target of 10"^ in the AWGN channel, and for the Nakagami-m fading channel, the gains 
can reach 2 dB for a target of 10^'' and m = 5. 

II. Proposed BICM Transceiver 

Throughout this paper, we use boldface letters Ct = [ci t, . . . , c^^t] to denote row vectors and 
capital boldface letters C = [cj, . . . , cjj]'^ to denote a matrix of M rows, where (■)^ denotes 
transposition. We denote probability by Pr(-) and the PDF of a random variable X by 'px{x). A 
Gaussian distribution with mean value /i and variance cr^ is denoted by A/'(/i, cr^), the Gaussian 
PDF with the same parameters by ip{X;fi,a) = -^=-exp{— ^^~J:^^ ), and the Q-function by 
Q{x) = exp combinations of i nonegative integers such that their sum 

is / is denoted by >Vi(/), where >Vi(/) = {[wi, ...,Wi] E (Z+)* : wi + . . . + Wi = I}. 

The HQAM-BICM system model under consideration is shown in Fig. [U In what follows, 
we describe functionalities of various blocks of such transmission scheme. 

A. Encoder, Multiplexing, and Interleaving 

The kc vectors of information bits ii = [i/ 1, . . . , i; jvj with I = 1, . . . , kc are encoded by a rate 
R = kc/n convolutional encoder (ENC) yielding the vectors of coded bits Cp = [cp i, . . . , Cp atJ 
with p = l,...,n. These are then fed to a deterministic multiplexing (D-MUX) unit which 
bijectively maps C = [cj, c^]T onto O = [oj, oj]'^ with = [ok,i, Ok,N,] and 
k = 1, . . . ,q. Without loss of generality, we assume N^q = NcU. The vector of bits after the 
D-MUX are fed to q parallel interleavers tt^. The q interleavers are assumed to be independent 
and give randomly permuted sequences of the bits, i.e., Uk = 7rk{ok}. Each of the interleavers is 
connected to the gth bit positions in the hierarchical M-ary pulse amplitude modulation (HPAM) 
constellation, where q = log2 M. 

In general, the D-MUX can be defined as a one-to-one mapping between the blocks of nNc 
and qNs bits, i.e., {0, l}"^': ^ {0, 1}^^=. We define it via an n x Nc matrix K, whose {p,t')th 
entry is a pair (k, t) where k E {1, . . . , g} and t E {1, . . . , A'^^}. The entry (k, t) indicates that 
the bit Cp t/ is assigned to the kth D-MUX's output at time instant t, i.e., Ok,t = Cp^t'- 
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The previous definition of the D-MUX is entirely general but difficult to deal with, and thus, 
in this paper we only consider D-MUX configurations that operate periodically over blocks of 
nJ bits. We then represent ^ as a concatenation of N^/ J matrices K^., each of dimensions 
n X J, i.e., K = [Kq, . . . , Kp^^/j^i], where the variable J is called the period of the D-MUX. 
The entries of K^^ are pairs {k, t + rnJ/q) with k E {1, . . . , q} and t G {1, . . . , nJ/q}. Without 
loss of generality, we assume that (A'c mod J) = and that (nJ mod q) = 0. To clarify these 
definitions, consider the following example. 

Example 1: Assume k^ = I and n = 2 (i? = 1/2), J = 3, and g = 3 (8-ary constellation). 
One possible D-MUX is defined by 

(l,l + 2r) (2,2 + 2r) (2, 1 + 2r) 
(l,2 + 2r) (3,2 + 2r) (3, 1 + 2r) 



(1) 



which results in 

'(1,1) (2,2) (2,1) 
(1,2) (3,2) (3,1) 
The mapping between C and O is then 



K 



(1.3) (2,4) (2,3) 

(1.4) (3,4) (3,3) 



Cl,l Ci,2 Ci,3 
C2,l C2,2 C2,3 



Cl,4 Ci,5 Ci,6 
C2,4 C2,5 C2fi 



o 



Cl,l 


C2,l 


Cl,4 


C2,4 




Cl,3 


Cl,2 


Cl,6 


Cl,5 




C2,3 


C2,2 


C2,6 


C2,5 





(2) 



Since a matrix is simply a permutation of the set {l,...,g} x {1, . . . , nJ/g}, {nJ)\ 
different matrices K^. can be generated. However, this number can be reduced since trivial 
operations that do not affect the performance of the system can be applied to Kr- For example, 
for the matrix in ([T]) with r = 0, consider the following two matrices: 



K' 



(1,2) (2,2) (2,1) 
(1,1) (3,1) (3,2) 



K" 

-"■0 



(2,1) (1,1) (2,2) 
(3,1) (1,2) (3,2) 



The matrix K'^^ is obtained by permuting the elements of Kq such that the first elements in the 
entries of Kq are not altered. Because M-interleavers are used after the D-MUX (cf. Fig. [I]), the 
temporal structure of the sequences is randomized, and thus, the second elements of the entries 
are not (which determines to which time bit Ck^t is assigned) are not relevant. Consequently, the 
performance of the system using Kq or K'q will be the same. The matrix Kq is obtained by 
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cyclically rotating the columns of the matrix Kq, which will produce K with columns shifted 
to the left or right. For long coded sequences, i.e., J <^ Nc, the original matrix and its shifted 
version will yield the same performance. 

B. HPAM Constellations 

In this paper, we consider HQAM constellations labeled by the binary reflected Gray code 
(BRGC) EOl presented in dH. In HQAM constellations, each symbol is a superposition of 
independently modulated real/imaginary parts, which allows us to focus on the equivalent HPAM 
constellation, cf. Fig. [U At any time instant t, the coded and interleaved bits [ui . . . , Ug^t] are 
mapped to an HPAM symbol x^{t) E X = {x\, . . . , x\j_{\ using a binary memoryless mapping 
: {0, 1}^ — 7- X. Since the mapper is memoryless, from now on we drop the time index t. 

We analyze HPAM constellations as the one shown in Fig. [2] (M = 8), which are defined by 
the distances dk with k = 1, . . . ,g. In this figure, the M constellation points are shown with 
black circles, where the white squares/triangles are "virtual" symbols that help to understand 
the construction of the HPAM constellation as explained below. We use A; = 1, . . . , g to denote 
the bit position of the binary labeling, where A; = 1 represents the left most bit position. The bit 
value of A; = 1 selects one of the two squares in Fig. [21 Similarly, for a given value of the first 
bit, the bit value for the next position (k = 2) selects one of the two triangles that surround the 
previously selected square. Finally, given the bit values for A; = 1 and k = 2, the bit value of 
bit position k = 3 selects one of the two black symbols that surround the previously selected 
triangle. This selected symbol (black circle) is finally transmitted by the modulator. 

We denote the base-2 representation of the integer < j < Af — 1 by the vector b(j) = 
[bi{j), . . . , bq{j)], where &i(j) is the most significant bit of j and bq{j) the least significant. This 
allows us to express the elements E X of the HPAM constellation as 



4 = E(-i)''^'^"'^'^- 

k=l 

We also define the normalized constellation parameters as 

A dk+1 

ak = —J—, (4) 

with k = 1, . . . , q — 1. Using ([3]), and for equiprobable symbol transmission, the average symbol 
energy is given by Es = {1 + J2k=i '^D^i with at is given by dH). Throughout this paper. 



December 9, 2010 



DRAFT 



7 



we consider that the constellation is normalized to have unit energy, which translates into the 
relation di = {aj + + . . . + a'^_^ + 

If the constellation points move freely, it is possible that they cross each other (by having for 
example ^3 < in Fig. [21), and therefore, the binary labeling is not the BRGC anymore. Since 
in this paper we restrict the analysis to the BRGC, extra constraints on the values of must 
be added, namely, 

g-l g-1 

ttfc > ^ aj, ^afc < 1, and Og^i > 0, (5) 

j=k+l k=l 

where k = — 1. The inequalities in ^ are found by solving ( ) > with 

j=0,...,M-2. 

Example 2 (Constellation parameters for M = 8): For M = 8, the constellation optimization 
space is formed by two variables, ai = 3.2/ di and 02 = d^/di, cf. ([4]). From ([5]) we have the 
following constrains ai > 02, ai + 02 < 1, and 02 > 0, which result in a pair of constellation 
parameters (ai, 02) shown in Fig. [3l In this figure, the evolution of the constellation for different 
values of (01,02) are shown; the shadowed region represents the values of (ai,Q!2) that give a 
BRGC-labeled constellation. Particularly important cases are the equally spaced 2-R^.M, 4-PAM, 
and 8-PAM constellations. 

The result of the transmission of a complex symbol x = + jx'^ is given hy y = hx + z, 
where h = h} + jh^ is the complex channel gain, and 2; is a complex Gaussian noise with 
zero mean and variance Nq/2 in each dimension. The amplitude of the channel gain \h\ follows 
a Nakagami-m distribution, and thus, the instantaneous SNR, defined as 7 = j^, follows a 
Gamma distribution, i.e., 

where G(m) is the Gamma function, 7 = Er[7] is the average SNR, and T is the random 
variable that represents the instantaneous SNR. The AWGN channel is obtained when \h\ = 1. 

At the receiver's side, the real part of the received signal is normalized by the channel gain 
h (y^ = ^{y/h}) and passed to the demapper, which computes logarithmic likelihood ratios 
(L-values) for each bit in the transmitted symbol. The A;th L-value given the transmitted symbol 
Xj and the channel gain h (or equivalently, 7) can be written as [Hl-Hl 



Pr(Mfc = 


o\y\i) 


Pr(Mfc = 





h{y \xj, 7) = log TT7 , I t' N ~ 7 



mill {(y^ - aY] - min {(y^ - af} 



(7) 
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where X^ i, is the set of symbols labeled with the kth bit equal to b, and where we have used 
the so-called max-log approximation HI, JH, ETI . 

The vectors of L-values calculated by the demapper are then deinterleaved, generating the 
sequence Ik = T^k^ih) with k = l,...,g, cf. Fig. \T\ These L-values are reorganized by the 
demultiplexer unit (DEMUX), defined as /™^/ = which simply inverts the process done by 
the D-MUX at the transmitter. Finally, these L-values are passed to the the channel decoder 
which produces an estimate of the transmitted bits. In this paper, we consider convolutional 
codes and a soft-input Viterbi decoder. 

in. Equivalent Channel Model 

In order to predict the coded BER performance of the system, finding the PDF of the L-values 
passed to the channel decoder is crucial. In what follows, we develop closed-form expressions for 
the PDF of L-values for HQAM-BICM transmission as a function of the constellation parameters. 
These expressions will later be used to compute bounds on the BER of the systems, and then, 
used to optimize the design of the system. From now on, all the analysis is made for the 
constituent HPAM constellation (cf. Fig. [T]), and thus, with a slight abuse of notation, we use x 
and y to denote the real part of the transmitted symbol and the real part of the received signal, 
respectively, i.e., we skip the superscript (■)^. 

The use of the max-log approximation in ([7]) transforms the nonlinear relation between the 
received signal and the L-values into a piecewise linear relation. Examples of this piece-wise 
linear relation can be found in literature, see for example [[22l Fig. 3], [[231 Fig. 2, Fig. 4], [[24l 
Fig. 3], or ll25l Table I]. This piece- wise linear relation has been used to develop expressions 
for the PDF of the L-values in (|7]) using arbitrary signal sets in ll26l (based on an algorithmic 
approach), closed-form expressions for ES-QAM constellations labeled with the BRGC for the 
AWGN channel in ll22l . and for fading channels in ll27l . Recently, closed-form approximations 
for the PDF of the L-values for arbitrary signal sets and binary labeling in fading channels have 
been developed in ll28l . 

For a given transmitted symbol Xj and SNR 7, F ~ J\f{xj, 1/(27)). Using a generalization 
of the so-called consistent model (CoMod) introduced in [[22|. the PDF of the L-values in ^ 
can be approximated by 

Lkixj] 7) ~ A/'(7/^fc,i, l^k,j)^ (8) 
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where 

/ifcj = (-l)^^^'^'j(xfcj - Xjf, al j = 2{xk,j - Xjf = 2|/ifcj|, (9) 

bj = [bij, . . . ,bqj] is the binary label of the symbol Xj, and Xkj is the closest symbol to Xj 
with the opposite bit value at bit position k. The model in ^ fulfills the consistency condition 
{alj = 2|yLifc and thus, the Gaussian distribution in ([8]) is completely determined by jjk,j- 

The results in ([S])-® can be considered as a particular case of the results in ll28l "Case 1", 
Fig. 2, Tab. 1]). In what follows, we will develop generic expressions for jjkj in dH) in terms of 
the distances defining the HPAM constellation. 

In Table HI we present the values of fi^j in (|9l) as a function of the constellation distances 
for M = 8. These values are obtained by direct inspection of Fig. [2l From this table we can see 
that the symmetry of the constellation is reflected in the mean values. For example, for k = 1, 
/ii 3 = — /ii,4, yUi,2 = — /^i,5> /^i,i = ~/^i,6j ^^d /ii q = —fJ'1,7- If wc aualyzc the variances, cf. (|9l), 
we note that for A; = 1 there are 4 different variances, for k = 2 two different variances, and for 
k = 3 only one. This idea can be generalized, i.e., from Table H it is possible to infer that for 
a given k, there are = || different variances, which are determined by the first values 
of j. This idea was previously used in [[22|. 

The performance evaluation in Sec. |lV]is based on the transmission of the all-zero sequence, 
and thus, here we only need to consider positive values of fif,j (from ni^j > if b^j = 0). 
From the evolution of ^kj in Table H we can write a generic closed-form expression for jj^j 
(for any M) in terms of constellation distance parameters as follows 

ldk,j = 4:idk- ^ 6fc/_fc(j)4'j , (10) 
^ k'=k+i ' 

where j = 0, 1, . . . , - 1, k = 1,2, and b{j) = [bq{j), . . . ,bi{j)] is the binary 

representation of the integer j where bi{j) is the least significant bit. 

Using the approximation for the PDF of the L-values given in ([8]), it is possible to build an 

equivalent model for the BICM channel shown in Fig. [T] [|22ll . [|28l . [|29ll . This model considers 

Mfc virtual channels, each of them determined by fj^kj in (flOl) . A given bit Ok^t = can be 

transmitted through the jth virtual channel with a probability given by = Then, the 

PDF of the L-values at the output of the kth interleaver can be expressed as a Gaussian 
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mixture with density given by 

Mfe-l M/2^-l 

Pl,{>^;i) = ^k,j'ip{^;if^kj,'2^f^k,j) = ip{X;'JHk,j,2'jfik,j)- (11) 

j=0 j=0 

IV. Performance Analysis 

In this section, we develop union bounds (UBs) on the BER of the HQAM-BICM system 
proposed in Sec. |ll] using the PDF of the L-values developed in Sec. Hill 

A. Union Bound 

We define a remerging sequence as a path in the trellis of the code (ENC) that leaves the zero 
state and remerge with it after certain number of trellis stages. The ENC and the D-MUX are 
grouped into an "equivalent code" (as shown in Fig.[T]) and characterized by an equivalent weight 
distribution spectrum (EWDS) f3K{w) with w = [wi, . . . ,Wq] E (Z+)''o We use the notation 
/3k (it^) to emphasize that the EWDS depends on the D-MUX configuration determined by K. 
This EWDS counts the Hamming weights of all the input sequences that generate remerging 
sequences with weight w at the D-MUX's output. 

Using the previous definitions, we can express the (truncated) UB on the BER as 

1 

BER<UB^— Y f3K{w)PEP{w;^), (12) 

where PEP(if;;7) is the pairwise error probability which represents the probability that the 
decoder selects a codeword with weight w instead of the transmitted all-zero codeword. The 
PEP can be expressed in terms of the decision variable D(w) as 

PEP{w;^) = Fi{D{w) > 0}, (13) 

where 

Wl Wq 

1=1 1=1 

and where L^*^ are independent samples of the random variables representing the L-values whose 
PDF is given by (fTTI) . In the following subsections, we will show how to compute (3k{w) and 
PEP(iu;7) required to evaluate the UB in (fT2)) . 

^To alleviate the notation, from now on we will refer to the matrix Kt as K. 
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B. Equivalent Weight Distribution Spectrum 

The vector w corresponds to the Hamming weights of the rows of the matrix O generated 
by remerging sequences represented by the matrix C. The correspondence between C and O 
is determined by the matrix K (which defines the D-MUX) as well as by the time at which the 
remerging sequence starts to diverge. However, due to the periodic structure of K, only J time 
instants must be considered. Based on this, the EWDS can be expressed as 



1 J 



j=l 



(15) 



where (3^^{w) represents the EWDS when the decoder starts to diverge at time t + j with 
arbitrary t. We note that the similarities between the EWDS in (fT5l) and the computation of the 
WDS of punctured convolutional codes |fT9l . The following example clarifies the main principle 
behind (fTSl) . while more details can be found in lfT9l Sec. II-B]. 

Example 3 (EWDS of the code (5, Consider the constraint length K = 3 convolutional 
code with polynomial generators (5, 7)8, with w^^^'^ = 5 and the D-MUX in Example [TJ For this 
code, there is one divergent sequence generated by an input sequence with Hamming weight one 



nd output weight The J = 3 possible input sequences are i\ 



(1) 



...,0,1,0,0,0,0,...], 



.-(2) 



[...,0,0,1,0,0,0,...], and if^ = [ 



,0,0,0,1,0,0, 



], which result in the following 



matrices C 



.01010 
.01110 



...00101... 
...00111... 



... 1 1 ... 
... 1 1 1 ... 
which by using Q yield 

...110... ...0011... ...00010... 

O^^) = ...100... , O^^) = ...0100... , O^^) = ...10010... 

...110... ...1100... ...10010... 

If we consider only this event at minimum Hamming distance (with input weight one), the final 
EWDS given by (fTS]) is obtained by computing the Hamming weights of the rows of O^^^ with 
J = 1,2,3, i.e.. 




[2,1,2] 
[1,2,2] 
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The spectrum ^Kiw) can be numerically calculated using a breadth-first search algorithm [|30l . 
Clearly, the spectrum must be truncated so that only diverging sequences with total Hamming 
weight wi + . . . Wq < w are considered, cf. (fT2l) . 

C. Computation of PEP (w;^) 

A common approach for computing the PEP in (fT3l) is through the use of the Laplace transform 
of the PDF of the decision variable (see for example ll28l and the references therein). However, 
due to its simplicity and accuracy, the saddlepoint approximation (SPA) OTl has recently attracted 
considerable interest. In this subsection, we use a generalization of the PEP computation based 
on the SPA used in [|27ll . Il28l . ll3T1l . and we apply it to BICM systems based on M-interleavers 
and HQAM constellations. 

Let $Lfe (s; 7) be the two-sided Laplace transform of the PDF of the L-value in (fTTI) . and 
let $'^^(s;7) and $'^^(s;7) be its first and second derivative with respect to s, respectively. Let 
also denote the so-called saddlepoint by s, where s is the solution of $'^^ (5; 7) = 0. 

Theorem 1: The PEP in (fT3l) can be approximated using the SPA as 



The PEP in Theorem [Hallows us to compute UBs on the BER for the proposed HQAM-BICM 
for the AWGN and Nakagami-m fading channels, as stated in the following two theorems. 
Theorem 2: The UB for HQAM-BICM for the AWGN channel (7 = 7) using the SPA is 




1 -1/2 




(16) 



k=l 



Proof: The proof is given in Appendix lAl 




A/fc-l 



1 -1/2 



q [Mfe-l 



n X] ^'^J (-/^fc,i7/4) 



(17) 



k=i L i=o 



Proof: The proof is given in Appendix |Bl 
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Theorem 3: The UB for HQAM-BICM for the Nakagami-m fading channel using the SPA is 



UB 



1 



T^l / Wk 7 

^ S^Mk-l ^ ( 4m 



(m.+l) 



-1/2 



n 

fe=i 



■A4-1 



4m 



4m + 



(18) 



Proof: The proof is given in Appendix O ■ 
The expressions in Theorems [2] and [3] explicitly show the mean values /i^j (which depend 
on the constellation parameters) and the EWDS of the code, and thus, they can be used to 
optimize the performance of the system. Moreover, we recognize that the PEP computation in 
(fT3l) for both AWGN and Nakagami-m fading channels can be done directly using the PDF of 
the L-values in (fTTI) (cf. [41 for the AWGN channel). Nevertheless, we used the SPA because it 
results in ready-to-use formulas, cf. (fTTI) and (fT8l) . 

We conclude this section by noting that the BICM models in [|4l and ^ can be regarded 
as a particular cases of the model we introduced in this paper. The model in [4| can be 
obtained using our model li J ^ Nc and the first elements of the entries (A;, t) of K, which 
represent the assignement of the coded bits to a particular interleaver, are randomly selected with 
predetermined probabilities. The BICM with S-interleavers (BICM-S) configuration of [2J can 
be obtained by letting J ^ and by selecting a matrix K with elements randomly permuted. 
By doing this, we assure that the coded bits are uniformly assigned over time and also over the 
bit positions. 



V. Numerical Results 

In this section, we present numerical examples that illustrate the gains that can be obtained 
by using an optimized HQAM-BICM system. In particular, we analyze two practically relevant 
spectral efficiencies: 1 bit/dimension and 1.5 bit/dimension. We use a rate R = 1/2 optimum 
distance spectrum convolutional code with constraint length K = and generator polynomial 
(5, 7)3. The decoding is based on the soft-input Viterbi algorithm without memory truncation, 
and the block length used for simulation is Nc = 24000. The optimization of the UB was carried 
out numerically via an exhaustive search over the valid range of constellation parameter(s) with 
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a step size of 0.01. In the following subsections, we use the names 2-PAM, 4-PAM, and 8-PAM 
to refer to the ES-PAM constellations. 

A. Spectral efficiency 1 bit/dimension 

For this particular case {n = q = 2) there is only one constellation parameter, i.e., ai. 
From Fig. |3] (with a2 = 0), we observe three cases of particular interest: ai = gives a 2-PAM 
constellation, ai = 1/2 gives a 4-PAM constellation, and ai = 1 gives a three-point constellation. 
We consider a D-MUX with period J = 2 which result in only four different matrices: 

(1.1) (2,1)1 

(1.2) (2,2) J ' 

(2,1) (2,2) 
(1,1) (1,2) _ ■ 

For this specific example, the D-MUX configurations given by K^^\ and K^^^ are identical to 
the two possible R-MUX configurations i.e., when all the coded bits from one encoder's 
output are assigned to one modulator's input. 

For both the AWGN channel and Nakagami-m fading channels, we first study the behavior 
of the UBs given by Theorem |2] and Theorem [3l respectivelj^ as a function of the constellation 
parameter ai. The UB is shown in Fig. |4] for the four different D-MUX configurations and 
different values of 7. This figure shows that for a given constellation parameter, different D- 
MUX configurations give different BER performances. In particular, for all the 3 cases in Fig. IH 
when «! = 1/2 (4-PAM) is considered, the lowest UB performance is obtained K = K^'^'^ (but 
this changes if another value of a is chosen). This is equivalent to the R-MUX when all the bits 
from (7)3 are assigned to A; = 1 and the coded bits from {5)s to k = 2, which was shown in 
JH (only for the AWGN case). From this figure, it is also clear that the 4-PAM constellation is 
suboptimal, i.e., by selecting another value of ai, gains are obtained. More particularly, for the 
AWGN case, by changing the value of «i from cti = 1/2 to ai = 0.12, the UB decreases from 

'All the results presented in this subsection were obtained using w — 125 for the AWGN channel and w — 30 for Nakagami-m 
fading channels, cf. l ll2b . 



(1,1) (1,2) 
(2,1) (2,2) 



(2,1) 
(1,2) 



(1,1) 
(2,2) 



, and K(^) 
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UB ^ 0.8 • 10"^ to UB ^ 0.7 ■ 10"^. The gains for the Nakagami-m fading channel are smaller 
but still visible. 

The UB depends on the average SNR, the constellation parameter, the D-MUX configuration, 
and m (in fading channels). Therefore, to obtain the optimal design, one needs to jointly optimize 
K and ai for each value of 7 and m (in fading channels) that minimize the UB. Because of 
the nature of the UB, the bound is tight only for BER below certain value (typically 10"'^ or 
10~^), and thus, the optimization will be valid only for BER below this limit. From now on, we 
denote the values that minimize the UB for a given 7 by 0^(7) and K*{'^). 

We performed a numerical optimization over K and cti for m = 1, m = 5, m = 20, and for 
the AWGN channel for the values of 7 that give a BER of interest (BER < 10~^). The results 
obtained showed that in such a case, the optimum D-MUX is always given by K*{^) = K^*\ 
The values of a* (7) obtained in the optimization are shown in Fig. [5] (left). This results show 
that when the fading is severe, the optimal constellation is close to a constellation, and 

in fact does not depend much on the value of 7. On the other hand, for the AWGN channel, the 
dependency on the average SNR is notable and a constellation is far from the optimum. 

In Fig. [5] (right), we show the behavior of the UB as a function of ai for a given average SNR 
7 = 9 dB and different channel conditions. This figure shows how the value a* (7) evolves from 
aj(7) ^ 0.5 (for m = 2) to al{^) ^ 0.16 for the AWGN case. 

We conclude this subsection by presenting the BER performance obtained by using the 
proposed HQAM-BICM system, where the constellation and the D-MUX are optimized for 
each SNR. The results are presented in Fig. [6l where we also show the results obtained by the 
conventional BICM-S system of [i2], and the one studied in [4] (R-MUX), both of them using 
a 4-R^M constellation. The resu 



in this paper (for BER < 10 



'4 



ts in this figure confirm the tightness of the UBs developed 
They also confirm that a joint optimization of the D-MUX 
and the constellation outperforms the previous designs. More particularly, for a BER target of 
10~^, the obtained gains are approximately 1 dB for the AWGN channel and 0.4 dB for m = 5. 
These gains decrease when the fading in the channel increases, and for m = 1 (Rayleigh fading 
channel), they are marginal. 

''We note a slight mismatch between the simulations and the bounds for m = 1 (for the three configurations shown in Fig. [SJ. 
We conjecture this is caused by the approximation used to model the PDF of the L-values, cf. Sec. [Hi] A similar mismatch is 



observed in Fig. [S] cf. Sec. IV-B 
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K*(7) 



(19) 



B. Spectral efficiency 1.5 bit/dimension 

We consider 8-ary constellations (g = 3), which together with the rate R = 1/2 code (n = 2) 
gives a spectral efficiency of 1.5 bits/dimension. In this case, the constellation is defined by the 
pair («!, a2). From Fig. [3l we see that the 8-PAM constellation is obtained with ^2 = 1/4 and 
«! = 1/2 and that 4-ary constellations are obtained with a2 = (4-PAM with ai = 1/2). We 
consider D-MUX configurations with the shortest possible period, i.e., J = 3, for which there 
will be a total of thirty different D-MUX configurations. The UBs presented in this subsection 
are evaluated for w = 30 for both fading and the AWGN channel. 

For the AWGN channel, and an average SNR 7 [dB] e {10, 11, . . . , 15} (which give a UB 
below 10""^), we obtained the optimal D-MUX configuration K*{^) and constellation parameters 
(a^ (7), 02(7)) using Theorem |2] and an exhaustive search. The optimal matrix for all 7 [dB] E 
{10, 11, ... , 15} was found to be 

'(1,1) (2,1) (3,1) 
_(3,2) (2,2) (1,2)_ 
and the optimal constellation parameters are 

(a* (7), «;(7)) = [(0.46, 0) (0.45, 0) (0.44, 0) (0.43, 0) (0.43, 0) (0.43, 0)] (20) 

The results in (|20l) indicate that the optimal constellation is a 4-ary constellation (0^2 = 0) with 
«! ~ 0.45, which translates into a system where the third output of the D-MUX (cf. (fT9l )) is 
completely eliminated. 

Another way of interpreting the results in (fT9l)-(l20l) is that for this code, the minimum 
BER is obtained when the original rate 1/2 code is punctured (giving a rate R = 3/4) and 
transmitted with a 4-ary constellation (with ai ~ 0.45), and a puncturing pattern given by 
P = [[1,0]^, [1, 1]^, [0, 1]^], where following the notation of Iil9il . the columns of P have a 
meaning of time and a denotes a puncture. 

An intuitive explanation of the previous results is the following. In a coded modulation system, 
and for high SNR values, there is a trade-off between the minimum Euclidian distance of the 
constellation and the minimum hamming distance of the code. By puncturing this code, its 
minimum Hamming distance w^^'^'^ = 5 will decrease. On the other hand, by reducing the 
constellation size (from 8-ary to 4-ary), the minimum Euclidian distance increases. For this 
particular code, the improvement due to an increased minimum Euclidian distance is larger than 
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the degradation due to a decrease minimum Hamming distance, and thus, the optimal solution 
is given by (fT9l)-(l20l). 

In Fig. U\ we present the results obtained using the proposed system based on the optimum 
parameters in (fT9l)-(|20l) and we compare them against four different BICM designs. The first 
one is the BICM-S system of [|2l with an 8-PAM constellation and the second one the BICM 
system with R-MUX and an 8-PAM constellation of [4^. The other two will be explained below. 
When comparing the proposed system and the BICM-S system of [2], gains of about 3 dB are 
observed for a BER target of 10^^. The gains compared to the system in flU are about 2.75 dB. 

The performance difference between the proposed system and the BICM system with R- 
MUX and an 8-PAM constellation of O are quite large (2.75 dB). However, the comparison is 
somehow unfair since our system allows HPAM constellations while the results for the system in 
JH are given for an 8-PAM constellation. To make a more fair comparison, we have optimized 
the constellation for the system with R-MUX of Hi, i.e., we selected the optimum constellation 
for each average SNRu- The obtained results are shown in Fig. U\ The performance of the system 
in BH improves, however, the proposed system still outperforms the R-MUX HPAM in 1.7 dB 
at a BER of 10^^. This can be explained by the fact that in the proposed system exploits the 
temporal behavior of the coded sequence in a deterministic way (via the D-MUX), contrary to 
the R-MUX of iH, which assigns the coded bits to different interleavers in a pseudo-random 
fashion. 

Finally, and motivated by the results in (fT9l)-(l20l). we analyze a BICM-S system, a 4-PAM 
constellation, and a puncturing defined by P that gives a rate R = 3/4. This configuration 
is simply another way of obtaining an spectral efficiency of 1.5 bit/dimension using the same 
encoder and decoder. We performed an exhaustive search over puncturing patterns and found 
that for the SNR of interest, the optimal puncturing pattern is the one given by P. The results 
are shown in Fig. Ul In this case, and for a target BER of 10~®, the proposed system still offers 
gains of about 0.4 dB. 

Now we turn our attention to Nakagami-m fading channels. Since the UB in Theorem [3] 

^The optimum R-MUX for this code is such that 2/3 of the coded bits form the second encoder's output are sent to fc = 1, 
1/3 of the bits from the first and the second encoder's outpus are sent to fc = 2, and 2/3 of the bits from the first encoder's 
output to fc = 3. 

'We obtained 01(7) = 0.49 for all 7 [dB] G {10, 11, ... , 16}. 
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depends on K, (01(7), 02 (7)), m, and 7, in general, the optimization must be done jointly 
over all these parameters. However, we have observed that for a given value of m, the optimal 
constellation and D-MUX do not change significantly for the SNR range of interest. Motivated 
by this observation, we have found the optimal constellation for an average SNR that gives a 



BER of approximately 10 ^, 
The obtained values are 



and we have used these values for all the range of average SNR. 



K 



m=l 



K 



m=2 



and 



(2.1) (3,1) (3,2) 

(2.2) (1,1) (1,2)^ 

(at(7),a2(7))U=i 

K(7),«2(7))|m=2 

(aI(7),«2(7))U=5 



K 



m=5 



(1.1) (2,1) (3,1) 

(3.2) (2,2) (1,2) 



(21) 



(0.48,0.20) 
(0.47,0.17) 
(0.42,0.01). 



(22) 
(23) 
(24) 



We note that by selecting one set of parameters for the range of average SNR and a given m 
is relevant from a practical point of view. This is simply because in practice it would be more 
difficult to change the constellation parameters and the MUX for each value of 7. 

In Fig. [8] we present the simulated BER obtained by HQAM-BICM using dlD-lEll) and the 
UB with optimized parameters for each SNR. We compare the results obtained by the proposed 
system against two previous BICM designs (as in Fig. (6]): the BICM-S system of [[21 with an 
8-R\M constellation and the BICM system with an R-MUX in From this figure we observe 
that in fading channels, the proposed HQAM-BICM again outperforms previous BICM designs. 
When compared to BICM-S, the proposed system offers gains up to 2 dB for m = 5 and a BER 
target of 10^''. The gains compared to the configuration in H are less than when compared to 
BICM-S, but still quite large. This figure also shows that the achievable gains increase when the 
fading is less severe (m increases). 



VI. Conclusions 

In this paper we proposed and studied a new BICM transmission framework that uses HQAM 
constellations in conjunction with a dterministic bit- level multiplexer and M-interleavers. It was 
shown that a number of degrees of freedom can be exploited, which in turn gives performance 
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improvements of a few decibels compared to previous BICM designs. The gains were shown to 
depend on the fading parameter, the BER target, and the spectral efficiency, and in general, they 
increase when the fading is less severe. 

There are a number of degrees of freedom that can be exploited in BICM transmission which 
may improve its performance even further. In particular, in this paper we only studied HQAM 
constellations labeled by the BRGC. The performance of BICM with other binary labelings and 
fully asymmetric constellation is still unknown. Moreover, the period of the MUX gives another 
degree of freedom not fully exploited in this paper (only short periods were considered). 

Appendix A 
Proof of Theorem □ 

The Laplace transform of the PDF of the decision variable D{w) in (fT4l) is 



^D^^){s--i) = \[[^LMl)T\ (25) 

k=l 

and its cumulant transform kd{w){s'i1) is 

f^D{w) {s; 7) = log[$D(«;) (s; 7)] = ^Wk log [$Lfc {s; 7)] • (26) 



k=l 



The PEP can be approximated using the saddlepoint approximation [|3TI as 

PEP(tt;;7)^ ^ _ = exp(/tj,(^)(g; 7)), (27) 

sy27r/€';5(^)(s;7) 

where 7) is the second derivative of the cummulant generating function evaluated at 

the saddlepoint s and can be expressed as 



k=l 



(28) 



Using (|26l) and (1281) in (|27|) completes the proof. 
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Appendix B 
Proof of Theorem [2] 

The two-sided Laplace transform of PLj.(A; 7) in (fTTI) . can be written as 

i=o 

= Yl ^k,jew {f^k,jl{s^ - s)) , 
i=o 

where to pass from (l29l) to (l30l) we used the transform pair A/'(A; /i, 2/i) <(=^ exp(/i(s^ ~ s)). The 
saddlepoint can be found by solving $'^^(s;7) = 0, which from (|30l) gives s = 1/2. 

From (|30l ). the second derivative of $Lj.(s;7) with respect to s at the saddlepoint (s = 1/2) 



(29) 
(30) 



IS 



*L,(V2;7)= Y 2efc,i/^fcj7exp(-/ifej7/4). 
Substituting dSO]) and (HB in (EH), the PEP over the AWGN channel is 



(31) 



PEP(i«;;7) 



E!=o ^ -^fcj/^fcj exp (-/ifcj7/4) 



1-1/2 



fc=i E,=o Cfc,jexp(-/Xfcj7/4) 



n 

k=l 



^ ^fcjexp(-/ifcj7/4) 



(32) 



Using (|32l) in (fT2l) completes the proof. 



Appendix C 
Proof of Theorem [3] 

Due to the linearity property of the Laplace transform, we have 

/>oo 

Jo 

Using ^ and & in ([33]), we obtain 

m 



^ \m-^Hk,j{s^ - s)J 



(33) 



(34) 



From (|34|) . the saddlepoint is s = 1/2, and the second derivative of $Lfe(s; 7) with respect to s 
at saddlepoint can be written as 

4m 



Affe-l 

$'^^.(1/2; 7)= Y 26,.^,. 
i=o 



4m + 7/ife^, 



(m+l) 



(35) 
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Substituting (|34l ) and (1351 ) in (fT6l) . the PEP over Nakagami-m fading is 



PEP(iy;7) 



7r7 2^u;fe 



4m 



(m.+l) 



fc=l 



Using (|36l) in (fT2l) completes the proof. 



A4-1 / 4m 

j=0 "Sfc J 4m+7^tfcj 



-1/2 



n 



fc=i L i=o 



4m 



4m + 



(36) 
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Fig. 1. Model of HQAM-BICM transmission: a channel encoder followed by the multiplexer (D-MUX), the interleavers 
(tti , . . . , TTq), the hierarchical M-PAM mapper, the channel, and the processing blocks at the receiver's side. 
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Fig. 2. HPAM (M = 8) constellation labeled with the BRGC. 
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Fig. 3. Constellation parameters for an HPAM constellation with Af = 8. The shadowed region shows the values of (qi, 02) 
that give a BRGC-labeled constellation and the 13 filled squared some particular constellations (not at scale). Any point outside 
this region conesponds to a non-BRGC constellation. 
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Fig. 4. UB versus the constellation parameter qi for the AWGN channel (left) and for Nakagami-m fading channels (right) 
for different D-MUX configurations and different values of 7. The UBs are given by Theorem |2] and Theorem |3] respectively. 
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- [dB] tti 

Fig. 5. The optimal constellation parameter ai(7) versus average SNR 7 for m = 1, m = 5, m = 20, and for the AWGN 
channel (left) and the UB versus ai for 7 = 9 dB and m = 2, m = 7, m = 20 and the AWGN channel (right). The UBs are 
given by Theorem |2] and Theorem [5] respectively. 
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I I 

'lYadilional BICM-S (HS-QAM) |i2|l 1- 




7 [dB] 

Fig. 6. UBs (lines) for n = g = 2 (1 bit/dimension), for Nakagami-m fading channels and the AWGN channel given by 
Theorem 12] and Theorem[3] respectively. Numerical simulations are also included (markers). The proposed HQAM-BICM system 
uses the optimal K*{j) = JsT'*' and 01(7) shown in Fig. |5] (left). The BICM-S system of |2] with a 4-PAM constellation and 
the BICM system with R-MUX and a 4-PAM constellation of f4l are shown for comparison. 
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Fig. 7. UB (lines) for n — 2 and q — 3 (1.5 bit/dimension) for the AWGN channel given by Theorem |2l The simulation 
results are shown with markers. The proposed HQAM-BICM system uses the optimized parameters in (|19|l-ll20b. The BICM-S 
system of |2| with an 8-PAM constellation and the BICM system with R-MUX and an 8-PAM constellation of f4l are shown 
for comparison. The R-MUX system of j4| with optimized constellation and the BICM-S system with a (punctured) code with 
R = 3/4: are also shown. 
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7 [dB] 

Fig. 8. UB (lines) for n = 2 and q — 3 (1.5 bit/dimension) for Nakagami-m fading channels given by Theorem [3] for 
m — 1,2,5. Numerical simulations are also included (markers). The simulations for the proposed HQAM-BICM system uses 
the values in ll2ll-ll24l. The BICM-S system of ||2| with an 8-PAM constellation and the BICM system with R-MUX and an 
8-PAM constellation of 0| are shown for comparison. 
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TABLE I 

Values of fikj for HPAM with M = 8 in 





Ml,3 


M2j 


M3j 


Xo 


+4d? 


+4di 


+4di 


Xl 


+4(di - d-if 


+4(d2 - d'if 


-4di 


X2 


+4(di - d2f 


-4(d2 - da)'' 


-4di 


X3 


+4(di - d2 - dsf 


-4di 


+4di 


Xa 


^4(di - da - d-if 


-4di 


+4di 


Xh 


-4(di - d2f 


-4(d2 - da)" 


-4di 


Xa 


-4(di - d3)2 


+4(d2 - dzf 


-4di 


X7 


-4d? 


+4di 


+4di 
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